The Bureau of Transportation Statistics’ Statistical Disclosure Limitation Method for Tabular Data: a Review

نویسندگان

  • J. Neil Russell
  • James P. Kelly
  • Fred Glover
چکیده

Overview The United States Department of Transportation’s Bureau of Transportation Statistics (BTS) is developing its confidentiality policy which is based on its legislative mandate (49 U.S.C. 111(i)) to protect individually identifiable information. Because the field of statistical disclosure limitation (SDL) research is still evolving, BTS wants to take advantage of the latest SDL research in updating its confidentiality policy and practices. To this end, BTS has undertaken research that seeks to develop a new method to limit disclosures in complex, multi-dimensional (up to five) tables that contain a hierarchical structure. This research will also offer a new way of protecting confidentiality while at the same time increasing access to data. In this paper we present a review of the literature of the current disclosure limitation methods for tabular data. We identify, describe, and classify the methods. We briefly describe BTS’ current methods of disclosure limitation and identify reasons for change and enhancement. Finally, we propose a new method for tabular data disclosure control and a research agenda that includes phases for developing and implementing the new method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

United Nations Statistical Commission and European Commission Economic Commission for Europe Statistical Office of the Conference of European Statisticians European Communities (eurostat) Joint Ece/eurostat Work Session on Statistical Data Confidentiality Balancing Data Quality and Confidentiality for Tabular Data Invited Paper

1. Tabular data are the earliest form and remain a staple of official statistics data products. Familiar examples of tabular data products in official statistics include count data such as age-race-sex and other demographic data, concentration (or percentage) data such in financial or energy utilization statistics, and magnitude data such as total retail sales or air pollution data. Confidentia...

متن کامل

Software for tabular data protection.

In order for national statistical offices to maintain the trust of the public to collect data and publish statistics of importance to society and decision-making, it is imperative that respondents (persons or establishments) be guaranteed privacy and confidentiality in return for providing requested confidential data. Consequently, for most survey and census data, disclosure limitation techniqu...

متن کامل

Wp. 47 English Only United Nations Economic Commission for Europe (unece) Conference of European Statisticians European Commission Statistical Office of the European

Many statistical agencies nowadays operate or envision tools for ad hoc creation and visualization of aggregate tables, ideally with web-access facilities. Users should be able to easily create their own customized tables. However, especially with heavily skewed business data, disclosure control issues usually are a big obstacle in this context, hardly solvable by traditional methods like cell ...

متن کامل

On Invariant Post Randomization for Statistical Disclosure Control

In this paper, we investigate certain operational and inferential aspects of invariant PRAM (post randomization method) as a tool for disclosure limitation of categorical data. Invariant PRAMs preserve unbiasedness of certain estimators, but inflate their variances and distort other attributes. We introduce the concept of strongly invariant PRAM, which does not affect data utility or the proper...

متن کامل

Measuring Identification Risk in Microdata Release and Its Control by Post-randomization

Statistical agencies often release a masked or perturbed version of survey data to protect respondents’ confidentiality. Ideally, a perturbation procedure should protect confidentiality without much loss of data quality, so that released data may practically be treated as original data for making inferences. One major objective is to control the risk of correctly identifying any respondent’s re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002